Hindi Subjective Lexicon : A Lexical Resource for Hindi Polarity Classification
نویسندگان
چکیده
With recent developments in web technologies, percentage web content in Hindi is growing up at a lighting speed. This information can prove to be very useful for researchers, governments and organization to learn what’s on public mind, to make sound decisions. In this paper, we present a graph based wordnet expansion method to generate a full (adjective and adverb) subjective lexicon. We used synonym and antonym relations to expand the initial seed lexicon. We show three different evaluation strategies to validate the lexicon. We achieve 70.4% agreement with human annotators and ∼79% accuracy on product review classification. Main contribution of our work 1) Developing a lexicon of adjectives and adverbs with polarity scores using Hindi Wordnet. 2) Developing an annotated corpora of Hindi Product Reviews.
منابع مشابه
A Fall-back Strategy for Sentiment Analysis in Hindi: a Case Study
Sentiment Analysis (SA) research has gained tremendous momentum in recent times. However, there has been little work in this area for an Indian language. We propose in this paper a fall-back strategy to do sentiment analysis for Hindi documents, a problem on which, to the best of our knowledge, no work has been done until now. (A) First of all, we study three approaches to perform SA in Hindi. ...
متن کاملAutomatic Classification of Hindi Verbs in Syntactic Perspective
We report of a rule based, knowledge-base driven tool to automatically classify Hindi verbs in syntactic perspective. We also report of developing the largest lexical resource for Hindi verbs along with the information on their class based on valency and some syntactic diagnostic tests as well as their morphological/inflectional type. We use this resource to develop the tool to automatically cl...
متن کاملA Sentiment Analyzer for Hindi Using Hindi Senti Lexicon
Supervised approaches have proved their significance in sentiment analysis task, but they are limited to the languages, which have sufficient amount of annotated corpus. Hindi is a language, which is spoken by 4.70% of the world population, but it lacks a sufficient amount of annotated corpus for natural language processing tasks such as Sentiment Analysis (SA). With the increase in demand and ...
متن کاملPhrase Pair Mappings for Hindi-English Statistical Machine Translation
In this paper, we present our work on the creation of lexical resources for the Machine Translation between English and Hindi. We describes the development of phrase pair mappings for our experiments and the comparative performance evaluation between different trained models on top of the baseline Statistical Machine Translation system. We focused on augmenting the parallel corpus with more voc...
متن کاملsAT k A Unified Computational Lexicon for Hindi-English Code-Switching
We investigate how lexicons of languages in contact are merged to generate a fused lexicon for the code-mixed variety. Using the HPSG formalism, we develop computational lexicons for Hindi and English, and explore how these can be merged to obtain a fused-lect lexicon. We consider the HindiEnglish Code Switching variety (HECS), a stable variety that has resulted from contact between these langu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012